MSU 41-539. ST25 
SEQUENCE LISTING 

<110> Sticklen, Masomeh B 
Maqbool , Shahina B 
Dale, Bruce E 

<120> TRANSGENIC PLANTS CONTAINING LIGNINASE AND CELLULASE WHICH 
DEGRADE 

LIGNIN AND CELLULOSE TO FERMENTABLE SUGARS 

<130> MSU 4*1-539 

<150> 60/242,408 
<151> 2000-10-20 

<160> 19 

<170> Patentln version 3.1 

<210> 1 

<211> 1110 

<212> DNA 

j : | <213> Oryza sativa 

<4 00> 1 





gggtcggaga 


tgccaccacg 


gccacaaccc 


acgagcccgg 


cgcgacacca 


ccgcgcgcgt 


60 


6 


tgagccagcc 


acaaacgccc 


gcggataggc 


gcgccgcacg 


cggccaatcc 


taccacatcc 


120 


o 


ccggcctccg 


cggctcgagc 


gccgtgccat 


ccgatccgct 


gagttttggc 


tatttatacg 


180 




taccgcggga 


gcctgtgtgc 


agagagtgca 


tctcaagaag 


tactcgagca 


aagaaggaga 


240 




gagcttggtg 


agctgcagag 


atggccccct 


ccgtgatggc 


gtcgtcggcc 


accaccgtcg 


300 


U* 


ctcccttcca 


gggctcaagt 


ccaccgccgg 


catgccgtcg 


cccgccgtcc 


gaactccagc 


360 


M 


ttcggcaacg 


tcagcatggc 


ggcaggatca 


ggtgcatgca 


ggtaattacc 


tactgatcca 


420 


acacacattc 


ttcttcttct 


tcttcttctt 


aaccaacatt 


aaccaacaac 


tcaattatcg 


480 




tttattcatt 


gaggtgtggc 


cgattgaggg 


catcaagaag 


ttcgagaccc 


tctcctacct 


540 




gccaccgctc 


accgtggagg 


acctcctgaa 


gcagatcgag 


tacctagctc 


cgttccaagt 


600 




ggtgccctgc 


ctcgagttca 


gcaaggtcgg 


atttgtctac 


cgtgagaacc 


acaagtcccc 


660 




tggatactac 


gacggcaggt 


actggaccat 


gtggaagctg 


cccatgttcg 


ggtgcaccga 


720 




cgccacccag 


gtcgtcaagg 


agctcgagga 


ggccaagaag 


gcgtaccctg 


atgcattcgt 


780 




ccgtatcatc 


ggcttcgaca 


acgttaggca 


ggtgcagctc 


atcagcttca 


tcgcctacaa 


840 




cccgggctgc 


gaggagtctg 


gtggcaacta 


agccgtcatc 


gtcatatata 


gcctcgttta 


900 




attgttcatc 


tctgattcga 


tgatgtctcc 


caccttgttt 


cgtgtgttcc 


cagtttgttt 


960 




catcgtcttt 


tgattttacc 


ggccgtgctc 


tgcttttgtt 


ttttcttttc 


acctgattct 


1020 




ctctctgact 


tgatgtaaga 


gtggtatctg 


ctacgactat 


atgttgtttg 


ggtgaggcat 


^ 1080 
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atgtgaatga aatctatgaa agctccggct 



<210> 2 

<211> 38 

<212> PRT 

<213> Oryza sativa 

<400> 2 

Met Ala Pro Ser Val Met Ala Ser Ser Ala Thr Thr Val Ala Pro Phe 
15 10 15 



Gin Gly Ser Ser Pro Pro Pro Ala Cys Arg Arg Pro Pro Ser Glu Leu 
20 25 30 



Gin Leu Arg Gin Arg Gin 

35 



<210> 3 

<211> 6 

<212> PRT 

<213> synthetic peptide 
<220> 

<221> SIGNAL 

<222> (1) . . (6) 

<223> targets the peroxisomes of plants 

<400> 3 

Arg Ala Val Ala Arg Leu 
1 5 



<210> 4 
<211> 3004 
<212> DNA 

<213> Acidothermus cellulolyticus 
<220> 

<221> CDS 

<222> (824) . . (2512) 

<223> E I beta-1 , 4-endoglucanase precursor 
<400> 4 

ggatccacgt tgtacaaggt cacctgtccg tcgttctggt agagcggcgg 
cgcacgatct ctcctttgtt gatgtcgacg gtcacgtggt tacggtttgc 
attttcgcgc tcgggcttgc tccggctgtc gggttcggtt tggcgtggtg 
gccgaggcga tcccaatgag ggcaagggca agagcggagc cgatggcacg 
gatggggtac gccgatgggg cgtggcgtcc ccgccgcgga cagaaccgga 
gtcacggtgc gacatgttgc cgtaccgcgg acccggatga caagggtggg 
gcctgtgagc tgccggctgg cgtctggatc atgggaacga tcccaccatt 



gatggtcacc 
ctcggccgcg 
tgcggagcac 
tcgggtggcc 
tgcggaatag 
tgcgcgggtc 
ccccgcaatc 
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gacgcgatcg ggagcagggc ggcgcgagcc ggaccgtgtg gtcgagccgg acgattcgcc 

catacggtgc tgcaatgccc agcgccatgt tgtcaatccg ccaaatgcag caatgcacac 

atggacaggg attgtgactc tgagtaatga ttggattgcc ttcttgccgc ctacgcgtta 

cgcagagtag gcgactgtat gcggtaggtt ggcgctccag ccgtgggctg gacatgcctg 

ctgcgaactc ttgacacgtc tggttgaacg cgcaatactc ccaacaccga tgggatcgtt 

cccataagtt tccgtctcac aacagaatcg gtgcgccctc atgatcaacg tgaaaggagt 

acgggggaga acagacgggg gagaaaccaa cgggggattg gcg gtg ccg cgc gca 

Val Pro Arg Ala 
1 

ttg egg cga gtg cct ggc teg egg gtg atg ctg egg gtc ggc gtc gtc 
Leu Arg Arg Val Pro Gly Ser Arg Val Met Leu Arg Val Gly Val Val 
5 10 15 20 

gtc gcg gtg ctg gca ttg gtt gec gca etc gec aac eta gec gtg ccg 
Val Ala Val Leu Ala Leu Val Ala Ala Leu Ala Asn Leu Ala Val Pro 
25 30 35 

egg ccg get cgc gec gcg ggc ggc ggc tat tgg cac acg age ggc egg 
Arg Pro Ala Arg Ala Ala Gly Gly Gly Tyr Trp His Thr Ser Gly Arg 
40 45 50 

gag ate ctg gac gcg aac aac gtg ccg gta egg ate gee ggc ate aac 
Glu lie Leu Asp Ala Asn Asn Val Pro Val Arg lie Ala Gly lie Asn 
55 60 65 

tgg ttt ggg ttc gaa acc tgc aat tac gtc gtg cac ggt etc tgg tea 
Trp Phe Gly Phe Glu Thr Cys Asn Tyr Val Val His Gly Leu Trp Ser 
70 75 80 

cgc gac tac cgc age atg etc gac cag ata aag teg etc ggc tac aac 
Arg Asp Tyr Arg Ser Met Leu Asp Gin lie Lys Ser Leu Gly Tyr Asn 
85 90 95 100 

aca ate egg ctg ccg tac tct gac gac att etc aag ccg ggc acc atg 
Thr lie Arg Leu Pro Tyr Ser Asp Asp lie Leu Lys Pro Gly Thr Met 
105 110 115 

ccg aac age ate aat ttt tac cag atg aat cag gac ctg cag ggt ctg 
Pro Asn Ser lie Asn Phe Tyr Gin Met Asn Gin Asp Leu Gin Gly Leu 
120 125 130 

acg tec ttg cag gtc atg gac aaa ate gtc gcg tac gec ggt cag ate 
Thr Ser Leu Gin Val Met Asp Lys lie Val Ala Tyr Ala Gly Gin lie 
135 140 145 

ggc ctg cgc ate att ctt gac cgc cac cga ccg gat tgc age ggg cag 
Gly Leu Arg lie lie Leu Asp Arg His Arg Pro Asp Cys Ser Gly Gin 
150 155 160 

teg gcg ctg tgg tac acg age age gtc teg gag get acg tgg att tec 
Ser Ala Leu Trp Tyr Thr Ser Ser Val Ser Glu Ala Thr Trp lie Ser 
165 170 175 180 

gac ctg caa gcg ctg gcg cag cgc tac aag gga aac ccg acg gtc gtc 
Asp Leu Gin Ala Leu Ala Gin Arg Tyr Lys Gly Asn Pro Thr Val Val 
185 190 195 
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01 



ggc ttt gac ttg cac aac gag ccg cat gac ccg gcc tgc tgg ggc tgc 1459 
Gly Phe Asp Leu His Asn Glu Pro His Asp Pro Ala Cys Trp Gly Cys 
200 205 210 

ggc gat ccg age ate gac tgg cga ttg gcc gcc gag egg gcc gga aac 1507 

Gly Asp Pro Ser lie Asp Trp Arg Leu Ala Ala Glu Arg Ala Gly Asn 
215 220 225 

gcc gtg etc teg gtg aat ccg aac ctg etc att ttc gtc gaa ggt gtg 1555 

Ala Val Leu Ser Val Asn Pro Asn Leu Leu lie Phe Val Glu Gly Val 

230 235 240 

cag age tac aac gga gac tec tac tgg tgg ggc ggc aac ctg caa gga 1603 

Gin Ser Tyr Asn Gly Asp Ser Tyr Trp Trp Gly Gly Asn Leu Gin Gly 
245 250 255 260 

gcc ggc cag tac ccg gtc gtg ctg aac gtg ccg aac cgc ctg gtg tac 1651 

Ala Gly Gin Tyr Pro Val Val Leu Asn Val Pro Asn Arg Leu Val Tyr 
265 270 275 

teg gcg cac gac tac gcg acg age gtc tac ccg cag acg tgg ttc age 1699 

Ser Ala His Asp Tyr Ala Thr Ser Val Tyr Pro Gin Thr Trp Phe Ser 
280 285 290 

gat ccg acc ttc ccc aac aac atg ccc ggc ate tgg aac aag aac tgg 1747 

Asp Pro Thr Phe Pro Asn Asn Met Pro Gly lie Trp Asn Lys Asn Trp 
295 300 305 

gga tac etc ttc aat cag aac att gca ccg gta tgg ctg ggc gaa ttc 1795 

Gly Tyr Leu Phe Asn Gin Asn lie Ala Pro Val Trp Leu Gly Glu Phe 

310 315 320 



^ s f acg aca ctg caa tec acg acc gac cag acg tgg ctg aag acg etc 1843 

O Gly Thr Thr Leu Gin Ser Thr Thr Asp Gin Thr Trp Leu Lys Thr Leu 

s 325 330 335 340 

f>! gtc cag tac eta egg ccg acc gcg caa tac ggt gcg gac age ttc cag 1891 

J^f Val Gin Tyr Leu Arg Pro Thr. Ala Gin Tyr Gly Ala Asp Ser Phe Gin 

345 350 355 

y? 

O tgg acc ttc tgg tec tgg aac ccc gat tec ggc gac aca gga gga att 1939 

Trp Thr Phe Trp Ser Trp Asn Pro Asp Ser Gly Asp Thr Gly Gly lie 
360 365 370 

etc aag gat gac tgg cag acg gtc gac aca gta aaa gac ggc tat etc 1987 

Leu Lys Asp Asp Trp Gin Thr Val Asp Thr Val Lys Asp Gly Tyr Leu 

375 380 385 

gcg ccg ate aag teg teg att ttc gat cct gtc ggc gcg tct gca teg 2035 

Ala Pro lie Lys Ser Ser lie Phe Asp Pro Val Gly Ala Ser Ala Ser 
390 395 400 

cct age agt caa ccg tec ccg teg gtg teg ccg tct ccg teg ccg age 2083 

Pro Ser Ser Gin Pro Ser Pro Ser Val Ser Pro Ser Pro Ser Pro Ser 
405 410 415 420 

ccg teg gcg agt egg acg ccg acg cct act ccg acg ccg aca gcc age 2131 

Pro Ser Ala Ser Arg Thr Pro Thr Pro Thr Pro Thr Pro Thr Ala Ser 
425 430 435 

ccg acg cca acg ctg acc cct act get acg ccc acg ccc acg gca age 2179 

Pro Thr Pro Thr Leu Thr Pro Thr Ala Thr Pro Thr Pro Thr Ala Ser 
440 445 450 
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ccg acg ccg tea ccg acg gca gec tec gga gec cgc tgc acc gcg agt 2227 

Pro Thr Pro Ser Pro Thr Ala Ala Ser Gly Ala Arg Cys Thr Ala Ser 
455 460 465 

tac cag gtc aac age gat tgg ggc aat ggc ttc acg gta acg gtg gee 2275 

Tyr Gin Val Asn Ser Asp Trp Gly Asn Gly Phe Thr Val Thr Val Ala 
470 475 480 

gtg aca aat tec gga tec gtc gcg acc aag aca tgg acg gtc agt tgg 2323 

Val Thr Asn Ser Gly Ser Val Ala Thr Lys Thr Trp Thr Val Ser Trp 
485 490 495 500 

aca ttc ggc gga aat cag acg att acc aat teg tgg aat gca gcg gtc 2371 

Thr Phe Gly Gly Asn Gin Thr lie Thr Asn Ser Trp Asn Ala Ala Val 

505 510 515 

acg cag aac ggt cag teg gta acg get egg aat atg agt tat aac aac 2419 

Thr Gin Asn Gly Gin Ser Val Thr Ala Arg Asn Met Ser Tyr Asn Asn 
520 525 530 

gtg att cag cct ggt cag aac acc acg ttc gga ttc cag gcg age tat 2467 

Val lie Gin Pro Gly Gin Asn Thr Thr Phe Gly Phe Gin Ala Ser Tyr 
535 540 545 

acc gga age aac gcg gca ccg aca gtc gee tgc gca gca agt taa 2512 

O Thr Gly Ser Asn Ala Ala Pro Thr Val Ala Cys Ala Ala Ser 

y!S 550 555 560 



tacgtcgggg agecgaeggg agggtcegga ccgtcggttc cccggcttcc acctatggag 2572 
cgaacccaac aatccggacg gaactgeagg taccagagag gaacgacacg aatgcccgcc 2632 
O atctcaaaac ggctgegage cggcgtcctc geeggggegg tgagcatege agcctccatc 2692 

n 

gtgccgctgg egatgeagea tcctgccatc gccgcgacgc aegtcgacaa tccctatgcg 2752 

^ ggagegaect tettegtcaa cccgtactgg gcgcaagaag tacagagega acggcgaacc 2812 

K = agaccaatgc cactctcgca gcgaaaatgc gcgtcgtttc cacatattcg aeggcegtet 2872 

m 

p ggatggaccg catcgctgcg ateaaeggeg teaaeggegg acccggcttg acgacatatc 2932 

■ tggacgccgc cctctcccag cagcagggaa ccacccctga agtcattgag attgtcatct 2992 

acgatctgcc gg 3004 

<210> 5 
<211> 562 
<212> PRT 

<213> Acidotherraus cellulolyticus 
<400> 5 

Val Pro Arg Ala Leu Arg Arg Val Pro Gly Ser Arg Val Met Leu Arg 
15 10 15 

Val Gly Val Val Val Ala Val Leu Ala Leu Val Ala Ala Leu Ala Asn 
20 25 30 

Leu Ala Val Pro Arg Pro Ala Arg Ala Ala Gly Gly Gly Tyr Trp His 
35 40 45 
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Thr Ser Gly Arg Glu lie Leu Asp Ala Asn Asn Val Pro Val Arg lie 
50 55 60 



Ala Gly lie Asn Trp Phe Gly Phe Glu Thr Cys Asn Tyr Val Val His 
65 70 75 80 



Gly Leu Trp Ser Arg Asp Tyr Arg Ser Met Leu Asp Gin lie Lys Ser 
8 5 90 95 



Leu Gly Tyr Asn Thr lie Arg Leu Pro Tyr Ser Asp Asp lie Leu Lys 
100 105 110 



Pro Gly Thr Met Pro Asn Ser lie Asn Phe Tyr Gin Met Asn Gin Asp 
115 120 125 



Leu Gin Gly Leu Thr Ser Leu Gin Val Met Asp Lys lie Val Ala Tyr 
130 135 140 



Ala Gly Gin lie Gly Leu Arg lie lie Leu Asp Arg His Arg Pro Asp 
145 150 155 160 



Cys Ser Gly Gin Ser Ala Leu Trp Tyr Thr Ser Ser Val Ser Glu Ala 
165 170 175 



Thr Trp lie Ser Asp Leu Gin Ala Leu Ala Gin Arg Tyr Lys Gly Asn 
180 185 190 



Pro Thr Val Val Gly Phe Asp Leu His Asn Glu Pro His Asp Pro Ala 
195 200 205 



Cys Trp Gly Cys Gly Asp Pro Ser lie Asp Trp Arg Leu Ala Ala Glu 
210 215 220 



Arg Ala Gly Asn Ala Val Leu Ser Val Asn Pro Asn Leu Leu lie Phe 
225 230 235 240 



Val Glu Gly Val Gin Ser Tyr Asn Gly Asp Ser Tyr Trp Trp Gly Gly 
245 250 255 



Asn Leu Gin Gly Ala Gly Gin Tyr Pro Val Val Leu Asn Val Pro Asn 
260 265 270 



Arg Leu Val Tyr Ser Ala His Asp Tyr Ala Thr Ser Val Tyr Pro Gin 
275 280 285 



Thr Trp Phe Ser Asp Pro Thr Phe Pro Asn Asn Met Pro Gly lie Trp 
290 295 300 
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Asn Lys Asn Trp Gly Tyr Leu Phe Asn Gin Asn lie Ala Pro Val Trp 
305 310 315 320 



Leu Gly Glu Phe Gly Thr Thr Leu Gin Ser Thr Thr Asp Gin Thr Trp 
325 330 335 



Leu Lys Thr Leu Val Gin Tyr Leu Arg Pro Thr Ala Gin Tyr Gly Ala 
340 345 350 



Asp Ser Phe Gin Trp Thr Phe Trp Ser Trp Asn Pro Asp Ser Gly Asp 
355 360 365 



Thr Gly Gly lie Leu Lys Asp Asp Trp Gin Thr Val Asp Thr Val Lys 
370 375 380 



Asp Gly Tyr Leu Ala Pro lie Lys Ser Ser lie Phe Asp Pro Val Gly 
385 390 395 400 



Ala Ser Ala Ser Pro Ser Ser Gin Pro Ser Pro Ser Val Ser Pro Ser 
405 410 415 



Pro Ser Pro Ser Pro Ser Ala Ser Arg Thr Pro Thr Pro Thr Pro Thr 
420 425 430 



Pro Thr Ala Ser Pro Thr Pro Thr Leu Thr Pro Thr Ala Thr Pro Thr 
435 440 445 



Pro Thr Ala Ser Pro Thr Pro Ser Pro Thr Ala Ala Ser Gly Ala Arg 
450 455 460 



Cys Thr Ala Ser Tyr Gin Val Asn Ser Asp Trp Gly Asn Gly Phe Thr 
465 470 475 480 



Val Thr Val Ala Val Thr Asn Ser Gly Ser Val Ala Thr Lys Thr Trp 
485 490 495 



Thr Val Ser Trp Thr Phe Gly Gly Asn Gin Thr lie Thr Asn Ser Trp 
500 505 510 



Asn Ala Ala Val Thr Gin Asn Gly Gin Ser Val Thr Ala Arg Asn Met 
515 520 525 



Ser Tyr Asn Asn Val lie Gin Pro Gly Gin Asn Thr Thr Phe Gly Phe 
530 535 540 



Gin Ala Ser Tyr Thr Gly Ser Asn Ala Ala Pro Thr Val Ala Cys Ala 
545 550 555 560 
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Ala Ser 



<210> 6 

<211> 1467 

<212> DNA 

<213> Actinomyces naeslundii 
<220> 

<221> CDS 

<222> (1) . . (1467) 

<223> beta-glucosidase 

<220> 

<221> misc_f eature 

<222> (339) . . (339) 

<223> nucleotide is uncertain 



O 



r 3 ^ 



<220> 

<221> mi sc_f eature 

<222> (947) . . (947) 

<223> nucleotide is uncertain 

<220> 

<221> misc_feature 

<222> (443) . . (443) 

<223> nucleotide is uncertain 

<400> 6 

atg acc gcc acg tec act act tct aag age aat ccg aac ttc ccc gac 48 
Met Thr Ala Thr Ser Thr Thr Ser Lys Ser Asn Pro Asn Phe Pro Asp 
15 10 15 

ggc ttc ctg tgg ggc ggg gcc acc gcc gcc aac cag ate gag ggc get 96 
Gly Phe Leu Trp Gly Gly Ala Thr Ala Ala Asn Gin lie Glu Gly Ala 
20 25 30 

tac aac gag gac ggc aag ggc ctg tec gtc cag gac gtc atg cct egg 144 
Tyr Asn Glu Asp Gly Lys Gly Leu Ser Val Gin Asp Val Met Pro Arg 
35 40 45 

ggc ate atg gcc cac ccc acc cag get ccc aca ccg gat aac ctt caa 192 
Gly lie Met Ala His Pro Thr Gin Ala Pro Thr Pro Asp Asn Leu Gin 
50 55 60 

get cga ggc gat cga cct tct acc acc get tac gcc gag gac ate tec 240 
Ala Arg Gly Asp Arg Pro Ser Thr Thr Ala Tyr Ala Glu Asp lie Ser 
65 70 75 80 

ctg ttc gcg gag atg ggt ttc aag gtc ttc cgc ttc tec ate gcc tgg 288 
Leu Phe Ala Glu Met Gly Phe Lys Val Phe Arg Phe Ser lie Ala Trp 
85 90 95 

age cgc ate ttc ccg etc ggc gac gag acc gag ccc aat gag gaa gga 336 
Ser Arg lie Phe Pro Leu Gly Asp Glu Thr Glu Pro Asn Glu Glu Gly 
100 105 110 

ctn gcc ttc tac gac egg gtc etc gac gag etc gag aag cac ggg ate 384 
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Xaa Ala Phe Tyr Asp Arg Val Leu Asp Glu Leu Glu Lys His Gly lie 
115 120 125 

gag cca ctg gtc acc ate age cac tac gag ace ccg ctg cac ctg gcg 432 
Glu Pro Leu Val Thr lie Ser His Tyr Glu Thr Pro Leu His Leu Ala 
130 135 140 

cgc acc tac gnc ggc tgg acc gac cgc cgc etc ate ggc ttc ttc gag 480 
Arg Thr Tyr Xaa Gly Trp Thr Asp Arg Arg Leu lie Gly Phe Phe Glu 
145 150 155 160 

cgc tac gec cgc acc ctg ttc gag cgc tat ggc aag egg gtc aag tac 528 
Arg Tyr Ala Arg Thr Leu Phe Glu Arg Tyr Gly Lys Arg Val Lys Tyr 
165 170 175 

tgg etc acc ttc aac gag ate aac tec gtg etc cat gag ccc ttc eta 576 
Trp Leu Thr Phe Asn Glu lie Asn Ser Val Leu His Glu Pro Phe Leu 
180 185 190 

tct ggg ggc gtc gec acg ccc aag gac agg ccc ccc gag cag gac etc 624 
Ser Gly Gly Val Ala Thr Pro Lys Asp Arg Pro Pro Glu Gin Asp Leu 
195 200 205 

tac cag gec ate caa aac gag etc gtc gee tec gcg gec gcg acc agg 672 
Tyr Gin Ala lie Gin Asn Glu Leu Val Ala Ser Ala Ala Ala Thr Arg 
210 215 220 

ate gec cat gag acc aae ccc gac ate cag gtc ggc tgc atg ate ctg 720 
lie Ala His Glu Thr Asn Pro Asp lie Gin Val Gly Cys Met lie Leu 



0| 225 230 235 240 

gee gat ccc acc tac ccg etc acc cct gat ccc egg gac gtg tgg gcg 768 
Ala Asp Pro Thr Tyr Pro Leu Thr Pro Asp Pro Arg Asp Val Trp Ala 
S;f 245 250 255 

D 

s gee aag cag gca gag cgc gee aac tac gee ttc gga gac etc cac gta 816 

Ala Lys Gin Ala Glu Arg Ala Asn Tyr Ala Phe Gly Asp Leu His Val 
fi 260 265 270 

^ cgt ggt gag tac ccc gga tac ctg egg egg acc ctg egg gac aag ggc 864 

W Arg Gly Glu Tyr Pro Gly Tyr Leu Arg Arg Thr Leu Arg Asp Lys Gly 

275 280 285 



ate gag ctg gag ate acc gag gag gac cgc gtg ctg ctg egg gag cac 912 
lie Glu Leu Glu lie Thr Glu Glu Asp Arg Val Leu Leu Arg Glu His 
290 295 300 

acc gtc gac ttc gtc tec ttc tec tac tac atg tnc gtg tgc gag acc 960 
Thr Val Asp Phe Val Ser Phe Ser Tyr Tyr Met Xaa Val Cys Glu Thr 
305 310 315 320 

gtc acc cag teg gec gag gec ggc egg ggc aac etc atg ggc ggc gtc 1008 
Val Thr Gin Ser Ala Glu Ala Gly Arg Gly Asn Leu Met Gly Gly Val 
325 330 335 

ccc aat ccc acc etc gag gec tec gag tgg gga tgg cag ate gac ccg 1056 
Pro Asn Pro Thr Leu Glu Ala Ser Glu Trp Gly Trp Gin lie Asp Pro 
340 345 350 

gcg ggc ctg cgc acc ate ctg aac gac tac tgg gac cgc tgg ggc aag 1104 
Ala Gly Leu Arg Thr lie Leu Asn Asp Tyr Trp Asp Arg Trp Gly Lys 
355 360 365 

cct ctg ttc ate gtc gag aac ggc ctg gga gee aag gac gtc etc gtt 1152 
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Pro Leu Phe lie Val Glu Asn Gly Leu Gly Ala Lys Asp Val Leu Val 
370 375 380 



gac gga ccc aac ggt ccc acg gtc gag gac gac tac cgc ate gec tac 
Asp Gly Pro Asn Gly Pro Thr Val Glu Asp Asp Tyr Arg lie Ala Tyr 
385 390 395 400 

atg aac gac cac ctg gtc cag gtc gec gag gec att gec gac ggc gtc 
Met Asn Asp His Leu Val Gin Val Ala Glu Ala lie Ala Asp Gly Val 
405 410 415 

gag gtc ctg ggc tac acc tec tgg ggc tgc ate gac ctg gtc teg gec 
Glu Val Leu Gly Tyr Thr Ser Trp Gly Cys lie Asp Leu Val Ser Ala 
420 425 430 

tec acc gec cag atg tec aag cgc tac ggg ttc ate tac gtg gac cgt 
Ser Thr Ala Gin Met Ser Lys Arg Tyr Gly Phe lie Tyr Val Asp Arg 
435 440 445 

gac gac ggc ggc aac ggc acc ctg gec cgc tac cgc aag aag tec ttc 
Asp Asp Gly Gly Asn Gly Thr Leu Ala Arg Tyr Arg Lys Lys Ser Phe 
450 455 460 

ggc tgg tac cgc gac gtc ate gee tec aac ggt gec tec etc gtg cct 
Gly Trp Tyr Arg Asp Val lie Ala Ser Asn Gly Ala Ser Leu Val Pro 
465 470 475 480 

ccg gtg cag gaa ccg ccg egg ggg tag 
Pro Val Gin Glu Pro Pro Arg Gly 
485 



<210> 7 

<211> 488 

<212> PRT 

<213> Actinomyces naeslundii 
<220> 

<221> misc_feature 

<222> (113) . . (113) 

<223> The f Xaa' at location 113 stands for Leu. 
<220> 

<221> misc_f eature 

<222> (148) . . (148) 

<223> The 'Xaa 1 at location 148 stands for Asp, Gly, Ala, or Val 
<220> 

<221> misc_feature 

<222> (316) . . (316) 

<223> The 'Xaa' at location 316 stands for Tyr, Cys, Ser, or Phe 
<220> 

<221> misc_feature 

<222> (339) . . (339) 

<223> nucleotide is uncertain 

<220> 

<221> misc_f eature 

<222> (947) . . (947) 

<223> nucleotide is uncertain 

<220> 

<221> misc feature 
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<222> (443) . . (443) 

<223> nucleotide is uncertain 



<400> 7 

Met Thr Ala Thr Ser Thr Thr Ser Lys Ser Asn Pro Asn Phe Pro Asp 
15 10 15 



Gly Phe Leu Trp Gly Gly Ala Thr Ala Ala Asn Gin He Glu Gly Ala 
20 25 30 



Tyr Asn Glu Asp Gly Lys Gly Leu Ser Val Gin Asp Val Met Pro Arg 
35 40 45 



Gly He Met Ala His Pro Thr Gin Ala Pro Thr Pro Asp Asn Leu Gin 
50 55 60 



Ala Arg Gly Asp Arg Pro Ser Thr Thr Ala Tyr Ala Glu Asp He Ser 
65 70 75 80 



Leu Phe Ala Glu Met Gly Phe Lys Val Phe Arg Phe Ser He Ala Trp 
85 90 95 



Ser Arg He Phe Pro Leu Gly Asp Glu Thr Glu Pro Asn Glu Glu Gly 
100 105 110 



Xaa Ala Phe Tyr Asp Arg Val Leu Asp Glu Leu Glu Lys His Gly He 
115 120 125 



Glu Pro Leu Val Thr He Ser His Tyr Glu Thr Pro Leu His Leu Ala 
130 135 140 



Arg Thr Tyr Xaa Gly Trp Thr Asp Arg Arg Leu He Gly Phe Phe Glu 
145 150 155 160 



Arg Tyr Ala Arg Thr Leu Phe Glu Arg Tyr Gly Lys Arg Val Lys Tyr 
165 170 175 



Trp Leu Thr Phe Asn Glu He Asn Ser Val Leu His Glu Pro Phe Leu 
180 185 190 



Ser Gly Gly Val Ala Thr Pro Lys Asp Arg Pro Pro Glu Gin Asp Leu 
195 200 205 



Tyr Gin Ala He Gin Asn Glu Leu Val Ala Ser Ala Ala Ala Thr Arg 
210 215 220 



He Ala His Glu Thr Asn Pro Asp He Gin Val Gly Cys Met He Leu 
225 230 235 240 
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Ala Asp Pro Thr Tyr Pro Leu Thr Pro Asp Pro Arg Asp Val Trp Ala 
245 250 255 



Ala Lys Gin Ala Glu Arg Ala Asn Tyr Ala Phe Gly Asp Leu His Val 
260 265 270 



Arg Gly Glu Tyr Pro Gly Tyr Leu Arg Arg Thr Leu Arg Asp Lys Gly 
275 280 285 



lie Glu Leu Glu lie Thr Glu Glu Asp Arg Val Leu Leu Arg Glu His 
290 295 300 



Thr Val Asp Phe Val Ser Phe Ser Tyr Tyr Met Xaa Val Cys Glu Thr 
305 310 315 320 



Val Thr Gin Ser Ala Glu Ala Gly Arg Gly Asn Leu Met Gly Gly Val 
325 330 335 



Pro Asn Pro Thr Leu Glu Ala Ser Glu Trp Gly Trp Gin lie Asp Pro 
340 345 350 



Ala Gly Leu Arg Thr lie Leu Asn Asp Tyr Trp Asp Arg Trp Gly Lys 
355 360 365 



Pro Leu Phe lie Val Glu Asn Gly Leu Gly Ala Lys Asp Val Leu Val 
370 375 380 



Asp Gly Pro Asn Gly Pro Thr Val Glu Asp Asp Tyr Arg lie Ala Tyr 
385 390 395 400 



Met Asn Asp His Leu Val Gin Val Ala Glu Ala lie Ala Asp Gly Val 
405 410 415 



Glu Val Leu Gly Tyr Thr Ser Trp Gly Cys lie Asp Leu Val Ser Ala 
420 425 430 



Ser Thr Ala Gin Met Ser Lys Arg Tyr Gly Phe lie Tyr Val Asp Arg 
435 440 445 



Asp Asp Gly Gly Asn Gly Thr Leu Ala Arg Tyr Arg Lys Lys Ser Phe 
450 455 460- 



Gly Trp Tyr Arg Asp Val lie Ala Ser Asn Gly Ala Ser Leu Val Pro 
465 470 475 480 



Pro Val Gin Glu Pro Pro Arg Gly 
485 
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<210> 8 

<211> 3072 

<212> DNA 

<213> Streptococcus salivarius 
<220> 

<221> CDS 

<222> (392) . . (2860) 

<223> 1, 6-alpha-glucanhydrolase 

<400> 8 

aactgaggcc gttgctccag tagcgacaac agaaataggt ccatcaactg ctactgttgc 

gacagatact gcaacaacag cgacagcttc tacaatcttt tcacaagctg tgccagcaga 

aagtgctagc tcagaaacgc ttgtagccag tgaagcacta gctcctgagt cagctgctgt 

ggaaaccatc acatcatcat ctgataatgc tactgaagca ggacgccatt caactgctca 

agtaacacca gttacagaag tgacagagca aaacttgaat ggtgatgcct acttgacaga 

tccagaaaca acaaaagcag cttatagcaa gacagatggt gatattaatt attccgttgt 

tgtgtctaat ccaacagcag aaactaagac g atg act gtc aac ttg aca ctt 

Met Thr Val Asn Leu Thr Leu 
1 5 

caa cat get tea gaa att ate ggt caa gat aac gtt gac ctt acg eta 
Gin His Ala Ser Glu lie lie Gly Gin Asp Asn Val Asp Leu Thr Leu 
10 15 20 

gcg gca gga get tea gec aag gtt tea aac ttg aca gta gcg tea gag 
Ala Ala Gly Ala Ser Ala Lys Val Ser Asn Leu Thr Val Ala Ser Glu 
25 30 35 

tgg ttg aca aac aat aca ggt tac ttg gtg aca ate agt gtc aac gat 
Trp Leu Thr Asn Asn Thr Gly Tyr Leu Val Thr lie Ser Val Asn Asp 
40 45 50 55 

aaa tea ggc aat gtc ttg tea age aag cgc get ggc ttg tct gtt gaa 
Lys Ser Gly Asn Val Leu Ser Ser Lys Arg Ala Gly Leu Ser Val Glu 
60 65 70 

gat gat tgg aca gtt ttc cca cgt tac ggt ate gta gca ggt tea cca 
Asp Asp Trp Thr Val Phe Pro Arg Tyr Gly lie Val Ala Gly Ser Pro 
75 80 85 

act gat caa aac agt att ctt gtt aaa aat ctt gaa gee tac cgt aaa 
Thr Asp Gin Asn Ser lie Leu Val Lys Asn Leu Glu Ala Tyr Arg Lys 
90 95 100 

gag ctt gag etc atg aag tct atg aat ate aac tea tat ttc ttc tat 
Glu Leu Glu Leu Met Lys Ser Met Asn lie Asn Ser Tyr Phe Phe Tyr 
105 110 115 

gat get tat aat gaa get aca gat cct ttc cca gaa ggt gtc gat age 
Asp Ala Tyr Asn Glu Ala Thr Asp Pro Phe Pro Glu Gly Val Asp Ser 
120 125 130 135 

ttt gtt caa aaa tgg aat ace tgg agt cac act cag gtt gac act aag 
Phe Val Gin Lys Trp Asn Thr Trp Ser His Thr Gin Val Asp Thr Lys 
140 145 150 
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get gtt aaa gaa ttg gtt gat caa gtt cat aag tea ggt get gtt gec 892 

Ala Val Lys Glu Leu Val Asp Gin Val His Lys Ser Gly Ala Val Ala 

155 160 165 

atg ctt tat aac atg att tea gca gat tea aat cca aag aat ccg gec 940 

Met Leu Tyr Asn Met lie Ser Ala Asp Ser Asn Pro Lys Asn Pro Ala 
170 175 180 

ctt cca ctt get get ttg get tat aac ttc tac gat age ttt ggt aag 988 

Leu Pro Leu Ala Ala Leu Ala Tyr Asn Phe Tyr Asp Ser Phe Gly Lys 

185 190 195 

aag ggt gaa ccg atg act tac act ate ggt gat aac cca act caa gtt 1036 

Lys Gly Glu Pro Met Thr Tyr Thr lie Gly Asp Asn Pro Thr Gin Val 

200 205 210 215 

tac tat gat ccg gcg aat cca gat tgg caa aaa tac ate gca ggt gtc 1084 

Tyr Tyr Asp Pro Ala Asn Pro Asp Trp Gin Lys Tyr lie Ala Gly Val 
220 225 230 

atg aaa tea get atg gat cgt atg gga ttc gat ggt tgg caa ggt gat 1132 

Met Lys Ser Ala Met Asp Arg Met Gly Phe Asp Gly Trp Gin Gly Asp 

235 240 245 

aca att ggt gac aac cgt gtg act gat tat gag cac cgt aac age aca 1180 

O Thr lie Gly Asp Asn Arg Val Thr Asp Tyr Glu His Arg Asn Ser Thr 

250 255 260 

tit 

ffj gac gag get gac tea cac atg atg tct gat tea tat gcg tea ttt att 1228 

J"f Asp Glu Ala Asp Ser His Met Met Ser Asp Ser Tyr Ala Ser Phe lie 

^ 265 270 275 

O aat gec atg aag gac etc ate ggt gaa aag tac tac ate aca ate aat 1276 

f'-J Asn Ala Met Lys Asp Leu lie Gly Glu Lys Tyr Tyr lie Thr lie Asn 

s * 280 285 290 295 

gat gtt aat ggt ggt aat gat gat aaa eta gee aag gca cgt caa gat 1324 

Asp Val Asn Gly Gly Asn Asp Asp Lys Leu Ala Lys Ala Arg Gin Asp 
H 300 305 310 

yl 

Q gtt gtt tat aat gag ctt tgg aca aac ggt ggt tea gtt att cca gga 1372 

£!L Val Val Tyr Asn Glu Leu Trp Thr Asn Gly Gly Ser Val lie Pro Gly 

315 320 325 

cgt atg cag gtt gee tat ggt gat ttg aaa gca cgt ate gat atg gta 1420 

Arg Met Gin Val Ala Tyr Gly Asp Leu Lys Ala Arg lie Asp Met Val 
330 335 340 

cgc aat aaa act ggt aaa tea ctt ate gtt ggt gec tac atg gaa gaa 1468 

Arg Asn Lys Thr Gly Lys Ser Leu lie Val Gly Ala Tyr Met Glu Glu 

345 350 355 

cca ggg att gat tat act gtt cct ggc gga aaa gca act aac ggt get 1516 

Pro Gly lie Asp Tyr Thr Val Pro Gly Gly Lys Ala Thr Asn Gly Ala 

360 365 370 375 

ggt aaa gat gee ctt get ggt aaa cca ttg caa get gat gcg act ctt 1564 

Gly Lys Asp Ala Leu Ala Gly Lys Pro Leu Gin Ala Asp Ala Thr Leu 
380 385 390 

etc gta gat gcg aca gta get gca gca ggt ggt tat cac atg tec att 1612 

Leu Val Asp Ala Thr Val Ala Ala Ala Gly Gly Tyr His Met Ser He 

395 400 405 
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cca tea gtt gac gag tta gta aac tea gca get gtt cca gtg gcg ata 2428 
Pro Ser Val Asp Glu Leu Val Asn Ser Ala Ala Val Pro Val Ala lie 
665 670 675 



get gtg tea gag ace gca cat gat aag aaa gat gac aac tea gta tct 2476 

Ala Val Ser Glu Thr Ala His Asp Lys Lys Asp Asp Asn Ser Val Ser 
680 685 690 695 

aat acg gat caa ggt aca gta gca tea gat tea ate act aca cca get 2524 

Asn Thr Asp Gin Gly Thr Val Ala Ser Asp Ser lie Thr Thr Pro Ala 
700 705 710 

tea gag get gca age aca get gee tea aca gtc tea tea gaa gta tea 2572 

Ser Glu Ala Ala Ser Thr Ala Ala Ser Thr Val Ser Ser Glu Val Ser 
715 720 725 

gaa agt gta aca gta tea teg gaa cca tea gaa act gaa aat agt tea 2620 

Glu Ser Val Thr Val Ser Ser Glu Pro Ser Glu Thr Glu Asn Ser Ser 

730 735 740 

gaa gca tea act tea gag tea gca act cca acg acg aca gca att tea 2668 

Glu Ala Ser Thr Ser Glu Ser Ala Thr Pro Thr Thr Thr Ala lie Ser 

745 750 755 

^ gaa tea cat gca gta gtt gaa cca gtg get tct ttg aca gaa tea gag 2716 

**\* Glu Ser His Ala Val Val Glu Pro Val Ala Ser Leu Thr Glu Ser Glu 

760 765 770 775 

€l 

g(j agt cag gca age act age ctt gtt tea gaa act aca age aca att gtc 2764 

L-s Ser Gin Ala Ser Thr Ser Leu Val Ser Glu Thr Thr Ser Thr lie Val 

r .« 780 785 790 

y tea gtt get ccg tea gaa gta tea gaa age aca tea gag gaa gtc ate 2812 

Ser Val Ala Pro Ser Glu Val Ser Glu Ser Thr Ser Glu Glu Val lie 
795 800 805 



f*\ ctt atg gac tat cag aaa aca tea ata gtt gga ata gac tct ctg tag 2860 

Leu Met Asp Tyr Gin Lys Thr Ser lie Val Gly lie Asp Ser Leu 
810 815 820 



EC? 



ctcctcgcgt ctcagaaacc ttaccaagta cttctgaaac gattacagaa gcagcatcac 2920 

tctttagcaa etatgeaaga tattcagaaa cagcaagctc agaatctcac tctatggtag 2980 

cagcttcttc agaagtttct attgaaaaat tagcagtatc tatcttgaaa gatactgagg 3040 

gaggcttgta tgatgeaaca acaatcagaa at 3072 

<210> 9 
<211> 822 
<212> PRT 

<213> Streptococcus salivarius 
<400> 9 

Met Thr Val Asn Leu Thr Leu Gin His Ala Ser Glu lie lie Gly Gin 
15 10 15 

Asp Asn Val Asp Leu Thr Leu Ala Ala Gly Ala Ser Ala Lys Val Ser 
20 25 30 
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Asn Leu Thr Val Ala Ser Glu Trp Leu Thr Asn Asn Thr Gly Tyr Leu 
35 40 45 



Val Thr lie Ser Val Asn Asp Lys Ser Gly Asn Val Leu Ser Ser Lys 
50 55 60 



Arg Ala Gly Leu Ser Val Glu Asp Asp Trp Thr Val Phe Pro Arg Tyr 
65 70 75 80 



Gly lie Val Ala Gly Ser Pro Thr Asp Gin Asn Ser lie Leu Val Lys 
85 90 95 



Asn Leu Glu Ala Tyr Arg Lys Glu Leu Glu Leu Met Lys Ser Met Asn 
100 105 110 



lie Asn Ser Tyr Phe Phe Tyr Asp Ala Tyr Asn Glu Ala Thr Asp Pro 
115 . 120 125 



Phe Pro Glu Gly Val Asp Ser Phe Val Gin Lys Trp Asn Thr Trp Ser 
130 135 140 



His Thr Gin Val Asp Thr Lys Ala Val Lys Glu Leu Val Asp Gin Val 
145 150 155 ' 160 



His Lys Ser Gly Ala Val Ala Met Leu Tyr Asn Met lie Ser Ala Asp 
165 170 175 



Ser Asn Pro Lys Asn Pro Ala Leu Pro Leu Ala Ala Leu Ala Tyr Asn 
180 185 190 



Phe Tyr Asp Ser Phe Gly Lys Lys Gly Glu Pro Met Thr Tyr Thr lie 
195 200 205 



Gly Asp Asn Pro Thr Gin Val Tyr Tyr Asp Pro Ala Asn Pro Asp Trp 
210 215 220 



Gin Lys Tyr lie Ala Gly Val Met Lys Ser Ala Met Asp Arg Met Gly 
225 230 235 240 



Phe Asp Gly Trp Gin Gly Asp Thr lie Gly Asp Asn Arg Val Thr Asp 
245 250 255 



Tyr Glu His Arg Asn Ser Thr Asp Glu Ala Asp Ser His Met Met Ser 
260 265 270 



Asp Ser Tyr Ala Ser Phe lie Asn Ala Met Lys Asp Leu lie Gly Glu 
275 280 285 
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Lys Tyr Tyr lie Thr lie Asn Asp Val Asn Gly Gly Asn Asp Asp Lys 
290 295 300 



Leu Ala Lys Ala Arg Gin Asp Val Val Tyr Asn Glu Leu Trp Thr Asn 
305 310 315 320 



Gly Gly Ser Val lie Pro Gly Arg Met Gin Val Ala Tyr Gly Asp Leu 
325 330 335 



Lys Ala Arg lie Asp Met Val Arg Asn Lys Thr Gly Lys Ser Leu lie 
340 345 350 



Val Gly Ala Tyr Met Glu Glu Pro Gly lie Asp Tyr Thr Val Pro Gly 

355 360 365 



Gly Lys Ala Thr Asn Gly Ala Gly Lys Asp Ala Leu Ala Gly Lys Pro 
370 375 380 



Leu Gin Ala Asp Ala Thr Leu Leu Val Asp Ala Thr Val Ala Ala Ala 
385 390 395 400 



Gly Gly Tyr His Met Ser lie Ala Ala Leu Ala Asn Ala Asn Ala Ala 
405 410 415 



Leu Asn Val Leu Gin Ser Ala Tyr Tyr Pro Thr Gin Tyr Leu Ser Val 
420 425 430 



Ala Lys Asp Thr lie Arg Lys Leu Tyr Asn Tyr Gin Gin Phe lie Thr 
435 440 445 



Ala Tyr Glu Asn Leu Leu Arg Gly Glu Gly Val Thr Asn Ser Thr Gin 
450 455 460 



Ala Val Ser Thr Lys Asn Ala Ser Gly Glu lie Leu Ser Lys Asp Ala 
465 470 475 480 



Leu Gly Val Thr Gly Asp Gin Val Trp Thr Phe Ala Lys Ser Gly Lys 
485 490 495 



Gly Phe Ser Thr Val Gin Met lie Asn Met Met Gly lie Asn Ala Gly 
500 505 510 



Trp His Asn Glu Glu Gly Tyr Ala Asp Asn Lys Thr Pro Asp Ala Gin 
515 520 525 



Glu Asn Leu Thr Val Arg Leu Ser Leu Ala Gly Lys Thr Ala Gin Glu 
530 535 540 
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Ala Ala Lys lie Ala Asp Gin Val Tyr Val Thr Ser Pro Asp Asp Trp 
545 550 555 560 



Ala Thr Ser Ser Met Lys Lys Ala Gin Ala Ser Leu Glu Thr Asp Glu 
565 570 575 



Asn Gly Gin Pro Val Leu Val lie Ser Val Pro Lys Leu Thr Leu Trp 
580 585 590 



Asn Met Leu Tyr lie Lys Glu Asp Thr Thr Ala Thr Pro Val Glu Pro 
595 600 605 



Val Thr Asn Gin Ala Gly Lys Lys Val Asp Asn Thr Val Thr Ser Glu 
610 615 620 



Ala Ser Ser Glu Thr Ala Lys Ser Glu Asn Thr Thr Val Asn Lys Gly 
625 630 635 640 



Ser Glu Ala Pro Thr Asp Thr Lys Pro Ser Val Glu Ala Pro Lys Leu 
645 650 655 



Asp Glu Thr Thr Lys Pro Ala Pro Ser Val Asp Glu Leu Val Asn Ser 
660 665 670 



Ala Ala Val Pro Val Ala lie Ala Val Ser Glu Thr Ala His Asp Lys 
675 680 685 



Lys Asp Asp Asn Ser Val Ser Asn Thr Asp Gin Gly Thr Val Ala Ser 
690 695 700 



Asp Ser lie Thr Thr Pro Ala Ser Glu Ala Ala Ser Thr Ala Ala Ser 
705 710 715 720 



Thr Val Ser Ser Glu Val Ser Glu Ser Val Thr Val Ser Ser Glu Pro 
725 730 735 



Ser Glu Thr Glu Asn Ser Ser Glu Ala Ser Thr Ser Glu Ser Ala Thr 
740 745 750 



Pro Thr Thr Thr Ala lie Ser Glu Ser His Ala Val Val Glu Pro Val 
755 760 765 



Ala Ser Leu Thr Glu Ser Glu Ser Gin Ala Ser Thr Ser Leu Val Ser 
770 775 780 



Glu Thr Thr Ser Thr lie Val Ser Val Ala Pro Ser Glu Val Ser Glu 
785 790 795 800 
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Ser Thr Ser Glu Glu Val lie Leu Met Asp Tyr Gin Lys Thr Ser lie 
805 810 815 



Val Gly lie Asp Ser Leu 
820 

<210> 10 
<211> 2220 
<212> DNA 

<213> Trichoderma longibrachiatum 
<400> 10 





aaggttagcc 


aagaacaata 


gccgataaag 


atagcctcat 


taaacggaat 


gagctagtag 


60 




gcaaagtcag 


cgaatgtgta 


tatataaagg 


ttcgaggtcc 


gtgcctccct 


catgctctcc 


120 




ccatctactc 


atcaactcag 


atcctccagg 


agacttgtac 


accatctttt 


gaggcacaga 


180 




aacccaatag 


tcaaccgcgg 


actggcatca 


tgtatcggaa 


gttggccgtc 


atcacggcct 


240 




tcttggccac 


agctcgtgct 


cagtcggcct 


gcactctcca 


atcggagact 


cacccgcctc 


300 




tgacatggca 


gaaatgctcg 


tctggtggca 


cttgcactca 


acagacaggc 


tccgtggtca 


360 


€l 
yj 


tcgacgccaa 


ctggcgctgg 


actcacgcta 


cgaacagcag 


cacgaactgc 


tacgatggca 


420 


, : z h 


acacttggag 


ctcgacccta 


tgtcctgaca 


acgagacctg 


cgcgaagaac 


tgctgtctgg 


480 


y 3 l 

F*\ 

M 


acggtgccgc 


ctacgcgtcc 


acgtacggag 


ttaccacgag 


cggtaacagc 


ctctccattg 


540 




gctttgtcac 


ccagtctgcg 


cagaagaacg 


ttggcgctcg 


cctttacctt 


atggcgagcg 


600 




acacgaccta 


ccaggaattc 


accctgcttg 


gcaacgagtt 


ctctttcgat 


gttgatgttt 


660 


5 : 
SSS5 


cgcagctgcc 


gtaagtgact 


taccatgaac 


ccctgacgta 


tcttcttgtg 


ggctcccagc 


720 


jp 


tgactggcca 


atttaaggtg 


cggcttgaac 


ggagctctct 


acttcgtgtc 


catggacgcg 


780 




gatggtggcg 


tgagcaagta 


tcccaccaac 


aacgctggcg 


ccaagtacgg 


cacggggtac 


840 




tgtgacagcc 


agtgtccccg 


cgatctgaag 


ttcatcaatg 


gccaggccaa 


cgttgagggc 


900 




tgggagccgt 


catccaacaa 


cgcaaacacg 


ggcattggag 


gacacggaag 


ctgctgctct 


960 




gagatggata 


tctgggaggc 


caactccatc 


tccgaggctc 


ttacccccca 


cccttgcacg 


1020 




actgtcggcc 


aggagatctg 


cgagggtgat 


gggtgcggcg 


gaacttactc 


cgataacaga 


1080 




tatggcggca 


cttgcgatcc 


cgatggctgc 


gactggaacc 


cataccgcct 


gggcaacacc 


1140 




agcttctacg 


gccctggctc 


aagctttacc 


ctcgatacca 


ccaagaaatt 


gaccgttgtc 


1200 




acccagttcg 


agacgtcggg 


tgccatcaac 


cgatactatg 


tccagaatgg 


cgtcactttc 


1260 




cagcagccca 


acgccgagct 


tggtagttac 


tctggcaacg 


agctcaacga 


tgattactgc 


1320 




acagctgagg 


agacagaatt 


cggcggatct 


ctttctcaga 


caagggcggc 


ctgactcagt 


1380 




tcaagaaggc 


tacctctggc 


ggcatggttc 


tggtcatgag 


tctgtgggat 


gatgtgagtt 


1440 
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tgatggacaa 


acatgcgcgt 


tgacaaagag 


tcaagcagct 


gactgagatg 


ttacagtact 


1500 


acgccaacat 


gctgtggctg 


gactccacct 


acccgacaaa 


cgagacctcc 


tccacacccg 


1560 


gtgccgtgcg 


cggaagctgc 


tccaccagct 


ccggtgtccc 


tgctcaggtc 


gaatctcagt 


1620 


ctcccaacgc 


caaggtcacc 


ttctccaaca 


tcaagttcgg 


acccattggc 


agcaccggca 


1680 


accctagcgg 


cggcaaccct 


cccggcggaa 


accgtggcac 


caccaccacc 


cgccgcccag 


1740 


ccactaccac 


tggaagctct 


cccggaccta 


cccagtctca 


ctacggccag 


tgcggcggta 


1800 


ttggctacag 


cggccccacg 


gtctgcgcca 


gcggcacaac 


ttgccaggtc 


ctgaaccctt 


1860 


actactctca 


gtgcctgtaa 


agctccgtgc 


gaaagcctga 


cgcaccggta 


gattcttggt 


1920 


gagcccgtat 


catgacggcg 


gcgggagcta 


catggccccg 


ggtgatttat 


tttttttgta 


1980 


tctacttctg 


acccttttca 


aatatacggt 


caactcatct 


ttcactggag 


atgcggcctg 


2040 


cttggtattg 


cgatgttgtc 


agcttggcaa 


attgtggctt 


tcgaaaacac 


aaaacgattc 


2100 


cttagtagcc 


atgcatttta 


agataacgga 


atagaagaaa 


gaggaaatta 


aaaaaaaaaa 


2160 


aaaaacaaac 


atcccgttca 


taacccgtag 


aatcgccgct 


cttcgtgtat 


cccagtacca 


2220 



^1? <210> 11 

W <211> 1263 

CJ <212> DNA 

|»a <213> Phanerochaete chrysosporium 

p*\ <220> 

<221> CDS 

W <222> (34) . . (1152) 

s <223> ckg4 ligninase precursor 



<400> 11 



gctacagctc accgtccggt ctcagcagca gca atg gcg ttc aag cag etc etc 54 

Met Ala Phe Lys Gin Leu Leu 
W 1 5 

gca gcc etc tec gtc gcc ctg acc etc cag gtc acc caa get gcc ccg 102 
Ala Ala Leu Ser Val Ala Leu Thr Leu Gin Val Thr Gin Ala Ala Pro 
10 15 20 

aac etc gac aag cgc gtc get tgc ccc gac ggc gtg cac acc gcc tec 150 
Asn Leu Asp Lys Arg Val Ala Cys Pro Asp Gly Val His Thr Ala Ser 
25 30 35 

aac gcg gcg tgc tgt gca tgg ttc ccg gtc etc gat gat ate cag cag 198 
Asn Ala Ala Cys Cys Ala Trp Phe Pro Val Leu Asp Asp lie Gin Gin 
40 45 50 55 

aac etc ttc cac ggt ggc cag tgc ggt gcc gag gcc cac gag gcc ctt 246 
Asn Leu Phe His Gly Gly Gin Cys Gly Ala Glu Ala His Glu Ala Leu 
60 65 70 

cgt atg gtc ttc cac gac tec ate get ate teg ccc aag ctt cag teg 294 
Arg Met Val Phe His Asp Ser lie Ala lie Ser Pro Lys Leu Gin Ser 
75 80 85 

cag ggc aag ttt ggc ggc ggc ggc gcg gac ggc teg ate att acc ttc 342 
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Gin Gly Lys Phe Gly Gly Gly Gly Ala Asp Gly Ser lie lie Thr Phe 

90 95 100 

tec teg ate gag ace acg tac cac ccg aac ate ggc etc gac gag gtc 

Ser Ser lie Glu Thr Thr Tyr His Pro Asn lie Gly Leu Asp Glu Val 

105 110 115 

gtc gee ate cag aag ccg ttc ate gcg aag cac ggc gtc ace cgt ggc 

Val Ala lie Gin Lys Pro Phe lie Ala Lys His Gly Val Thr Arg Gly 

120 125 130 135 

gac ttc ate gca ttc get ggt gee gtc ggc gtg age aac tgc ccg ggc 

Asp Phe lie Ala Phe Ala Gly Ala Val Gly Val Ser Asn Cys Pro Gly 

140 145 150 

gcg ccg cag atg cag ttc ttc ctt ggc cgc ccc gag gca acg cag gee 

Ala Pro Gin Met Gin Phe Phe Leu Gly Arg Pro Glu Ala Thr Gin Ala 

155 160 165 

gee ccc gac ggt etc gtg ccc gag ccc ttc cac ace ate gat cag gtt 

Ala Pro Asp Gly Leu Val Pro Glu Pro Phe His Thr lie Asp Gin Val 

170 175 180 

etc get cgc atg ctt gac get ggt ggc ttc gac gag ate gag act gtc 

Leu Ala Arg Met Leu Asp Ala Gly Gly Phe Asp Glu lie Glu Thr Val 

185 190 195 

tgg ctg etc tct gee cac tec ate gcg get gcg aac gac gtc gac ccg 

Trp Leu Leu Ser Ala His Ser lie Ala Ala Ala Asn Asp Val Asp Pro 

200 205 210 215 

ace ate tec ggc ctg ccg ttc gac tec act ccc ggc cag ttc gac tec 

Thr lie Ser Gly Leu Pro Phe Asp Ser Thr Pro Gly Gin Phe Asp Ser 

220 225 230 

cag ttc ttc gtc gag acg cag etc cgc ggt ace gca ttc cct ggc aag 

Gin Phe Phe Val Glu Thr Gin Leu Arg Gly Thr Ala Phe Pro Gly Lys 

235 240 245 

act ggt ate cag ggc ace gtc atg tec ccg etc aag ggc gag atg cgt 

Thr Gly lie Gin Gly Thr Val Met Ser Pro Leu Lys Gly Glu Met Arg 

250 255 260 

ctg cag acg gac cac ttg ttc gcg cgt gac teg cgc acg gca tgc gag 

Leu Gin Thr Asp His Leu Phe Ala Arg Asp Ser Arg Thr Ala Cys Glu 

265 270 275 

tgg cag tec ttc gtc aac aac cag acg aag ctg cag gag gac ttc cag 

Trp Gin Ser Phe Val Asn Asn Gin Thr Lys Leu Gin Glu Asp Phe Gin 

280 285 290 295 

ttc ate ttc acg gcg etc teg acg etc ggc cac gac atg aac gec atg 

Phe lie Phe Thr Ala Leu Ser Thr Leu Gly His Asp Met Asn Ala Met 

300 305 310 

ate gac tgc tec gag gtc ate ccc gcg ccc aag ccc gtc aac ttc ggc 

lie Asp Cys Ser Glu Val lie Pro Ala Pro Lys Pro Val Asn Phe Gly 

315 320 325 

ccg teg ttc ttc ccc gee ggt aag acg cac gee gac ate gag cag gee 

Pro Ser Phe Phe Pro Ala Gly Lys Thr His Ala Asp lie Glu Gin Ala 

330 335 340 

tgc gca tec acg ccg ttc ccg acg etc ate ace gec ccc ggt ccc tct 
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.3 5. 



Cys Ala Ser Thr Pro Phe Pro Thr Leu lie Thr Ala Pro Gly Pro Ser 
345 350 355 

gcg tec gtc get cgc ate ccc ccg ccg ccg tec ccc aac taa 1152 
Ala Ser Val Ala Arg lie Pro Pro Pro Pro Ser Pro Asn 
360 365 370 

gctatgtcta tgctggacat getcteggtt ctacctcgtc ggtategteg caeggttate 1212 
tcgcgtttgc atcatgtata cctgctcgtg gaatatacaa agtggtctat c 1263 

<210> 12 
<211> 372 
<212> PRT 

<213> Phanerochaete chrysosporium 
<400> 12 

Met Ala Phe Lys Gin Leu Leu Ala Ala Leu Ser Val Ala Leu Thr Leu 
15 10 15 

Gin Val Thr Gin Ala Ala Pro Asn Leu Asp Lys Arg Val Ala Cys Pro 
20 25 30 

Asp Gly Val His Thr Ala Ser Asn Ala Ala Cys Cys Ala Trp Phe Pro 



m 35 40 45 



Val Leu Asp Asp lie Gin Gin Asn Leu Phe His Gly Gly Gin Cys Gly 
50 55 60 



Ala Glu Ala His Glu Ala Leu Arg Met Val Phe His Asp Ser lie Ala 
65 70 75 80 



01 lie Ser Pro Lys Leu Gin Ser Gin Gly Lys Phe Gly Gly Gly Gly Ala 

n 85 90 95 

H 

Asp Gly Ser lie lie Thr Phe Ser Ser lie Glu Thr Thr Tyr His Pro 
100 105 110 

Asn lie Gly Leu Asp Glu Val Val Ala lie Gin Lys Pro Phe lie Ala 
115 120 125 

Lys His Gly Val Thr Arg Gly Asp Phe lie Ala Phe Ala Gly Ala Val 
130 135 140 

Gly Val Ser Asn Cys Pro Gly Ala Pro Gin Met" Gin Phe Phe Leu Gly 
145 150 155 160 

Arg Pro Glu Ala Thr Gin Ala Ala Pro Asp Gly Leu Val Pro Glu Pro 
165 170 175 

Phe His Thr lie Asp Gin Val Leu Ala Arg Met Leu Asp Ala Gly Gly 
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180 



185 



190 



Phe Asp Glu lie Glu Thr Val Trp Leu Leu Ser Ala His Ser lie Ala 
195 200 205 



Ala Ala Asn Asp Val Asp Pro Thr lie Ser Gly Leu Pro Phe Asp Ser 
210 . 215 220 



Thr Pro Gly Gin Phe Asp Ser Gin Phe Phe Val Glu Thr Gin Leu Arg 
225 230 235 240 



Gly Thr Ala Phe Pro Gly Lys Thr Gly lie Gin Gly Thr Val Met Ser 
245 250 255 



Pro Leu Lys Gly Glu Met Arg Leu Gin Thr Asp His Leu Phe Ala Arg 
260 265 270 



Asp Ser Arg Thr Ala Cys Glu Trp Gin Ser Phe Val Asn Asn Gin Thr 
275 280 285 



Lys Leu Gin Glu Asp Phe Gin Phe lie Phe Thr Ala Leu Ser Thr Leu 
290 295 300 



Gly His Asp Met Asn Ala Met lie Asp Cys Ser Glu Val lie Pro Ala 
305 310 315 320 



Pro Lys Pro Val Asn Phe Gly Pro Ser Phe Phe Pro Ala Gly Lys Thr 
325 330 335 



His Ala Asp lie Glu Gin Ala Cys Ala Ser Thr Pro Phe Pro Thr Leu 
340 345 350 



lie Thr Ala Pro Gly Pro Ser Ala Ser Val Ala Arg lie Pro Pro Pro 
355 360 365 



Pro Ser Pro Asn 
370 



<210> 13 

<211> 1285 

<212> DNA 

<213> Phanerochaete chrysosporium 

<220> 
<221> CDS 

<222> (34).. (1149) <223> CKG5 ligninase precursor 
<400> 13 

gtcagactct ccaacggttg cctttggaca gac atg gcc ttc aag aag etc ctt 

Met Ala Phe Lys Lys Leu Leu 



Page 24 



1 



5 
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250 255 260 

gac ttc ctg ate get cgt gac gcg cgc acc gee tgc gag tgg cag teg 870 
Asp Phe Leu lie Ala Arg Asp Ala Arg Thr Ala Cys Glu Trp Gin Ser 
265 270 275 

ttc gtc aac aac cag tec aag etc gtc tec gac ttc caa ttc ate ttc 918 
Phe Val Asn Asn Gin Ser Lys Leu Val Ser Asp Phe Gin Phe lie Phe 
280 285 290 295 

etc gec etc act cag etc ggc cag gac ccg gat gcg atg acc gac tgc 966 
Leu Ala Leu Thr Gin Leu Gly Gin Asp Pro Asp Ala Met Thr Asp Cys 
300 305 310 

tct get gtc ate ccc ate tec aag ccc gec ccg aac aac acc ccc gga 1014 
Ser Ala Val lie Pro lie Ser Lys Pro Ala Pro Asn Asn Thr Pro Gly 
315 320 325 

ttc tec ttc ttc ccg ccc ggc atg acg atg gac gat gtc gag cag get 1062 
Phe Ser Phe Phe Pro Pro Gly Met Thr Met Asp Asp Val Glu Gin Ala 
330 335 340 

tgc gee gag acg ccc ttc ccg act etc teg act etc cct ggc ccc gcg 1110 
Cys Ala Glu Thr Pro Phe Pro Thr Leu Ser Thr Leu Pro Gly Pro Ala 
345 350 355 

acc tec gtc get cgc ate cct cct cct cct ggt get taa gcagccatca 1159 
g 1 ! Thr Ser Val Ala Arg lie Pro Pro Pro Pro Gly Ala 

fj- 360 365 370 



f 1 * 



gaetteggat cacaccccgg tattggcaac ggaaatttag aacgaagatc gtccagtgtt 1219 
ttgaagtaga aatgtgcttg tactgtgtaa acagctcttt tgacgaaata cactctgatt 1279 
tegteg 1285 



<210> 14 
M* <211> 371 

y] <212> PRT 



<213> Phanerochaete chrysosporium 
<400> 14 

Met Ala Phe Lys Lys Leu Leu Ala Val Leu Thr Ala Ala Leu Ser Leu 
15 10 15 



Arg Ala Ala Gin Gly Ala Ala Val Glu Lys Arg Ala Thr Cys Ser Asn 
20 25 30 



Gly Lys Val Val Pro Ala Ala Ser Cys Cys Thr Trp Phe Asn Val Leu 
35 40 45 



Ser Asp lie Gin Glu Asn Leu Phe Asn Gly Gly Gin Cys Gly Ala Glu 
50 55 60 



Ala His Glu Ser lie Arg Leu Val Phe His Asp Ala lie Ala lie Ser 
65 70 75 80 
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Pro Ala Met Glu Pro Gin Ala Ser Ser Val Arg Gly Ala Asp Gly Ser 
85 90 95 



lie Met lie Phe Asp Glu He Glu Thr Asn Phe His Pro Asn He Gly 
100 105 110 



Leu Asp Glu lie Val Arg Leu Gin Lys Pro Phe Val Gin Lys His Gly 
115 120 125 



Val Thr Pro Gly Asp Phe He Ala Phe Ala Gly Ala Val Ala Leu Ser 
130 135 140 



Asn Cys Pro Gly Ala Pro Gin Met Asn Phe Phe Thr Gly Arg Ala Pro 
145 150 155 160 



Ala Thr Gin Pro Ala Pro Asp Gly Leu Val Pro Glu Pro Phe His Ser 
165 170 175 



Val Asp Gin He He Asp Arg Val Phe Asp Ala Gly Glu Phe Asp Glu 
180 185 190 



Leu Glu Leu Val Trp Met Leu Ser Ala His Ser Val Ala Ala Ala Asn 
195 200 205 



Asp He Asp Pro Asn He Gin Gly Leu Pro Phe Asp Ser Thr Pro Gly 
210 215 220 



He Phe Asp Ser Gin Phe Phe Val Glu Thr Gin Leu Ala Gly Thr Gly 
225 230 235 240 



Phe Thr Gly Gly Ser Asn Asn Gin Gly Glu Val Ser Ser Pro Leu Pro 
245 250 255 



Gly Glu Met Arg Leu Gin Ser Asp Phe Leu He Ala Arg Asp Ala Arg 
260 265 270 



Thr Ala Cys Glu Trp Gin Ser Phe Val Asn Asn Gin Ser Lys Leu Val 
275 280 285 



Ser Asp Phe Gin Phe He Phe Leu Ala Leu Thr Gin Leu Gly Gin Asp 
290 295 300 



Pro Asp Ala Met Thr Asp Cys Ser Ala Val He Pro He Ser Lys Pro 
305 310 315 320 



Ala Pro Asn Asn Thr Pro Gly Phe Ser Phe Phe Pro Pro Gly Met Thr 
325 330 335 
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Met Asp Asp Val Glu Gin Ala Cys 
340 


Ala Glu Thr 
345 


Pro Phe Pro Thr Leu 
350 




Ser Thr Leu Pro Gly Pro Ala Thr 
355 360 


Ser Val Ala 


Arg lie Pro Pro Pro 
365 




Pro Gly Ala 
370 












<210> 15 
<211> 360 
<212> DNA 

<213> Solanum tuberosum 










<400> 15 














tgaccctaga 


cttgtccatc 


ttctggattg 


gccaagttaa 


ttaatgtatg 


aaataaaagg 


60 


atgcacacat 


agtgacatgc 


taatcactat 


aatgtgggca 


tcaaagttgt 


gtgttatgtg 


120 


taataactaa 


ttatctgaat 


aagagaaaga 


gagatcatcc 


_4__.i-4_.i__4-.l__ 


t cctaaa tga 


180 


atgacagtgt 


ctttataatt 


ctttgatgaa 


cagatgcatt 


ttattaacca 


attccatata 


240 


catataaata 


ttaatcatat 


ataattaata 


tcaattggtt 


agcaaaaccc 


aaatctagtc 


300 


taggtgtgtt 


ttgctaatta 


tgggggatag 


agcaaaaaag 


aaactaacgt 


ctcaagaatc 


360 


<210> 16 
<211> 2521 
<212> DNA 

<213> Agrobacterium tumefaciens 










<220> 

<221> CDS 

<222> (585) . . (1826) 

<223> nopaline synthetase 










<400> 16 














tagccgaccc 


agacgagcca 


agggatcttt 


ttggaatgct 


octccatcat 


V— * CL \J \-2 ^— * L-» ' ► 


60 


gacgtttggg 


tggttgaaca 


gaagtcatta 


tcgtacggaa 


w y \»» ^— ci a u ca 


■\- c c ct nnnrt 
— - » — v — ay y y y 


120 


aaccctgtgg 


ttggcatgca 


catacaaatg 


gacgaacgga 






180 


taaatatccg 


ttattctaat 


aaacgctctt 


ttctcttagg 


tttacccgcc 


aatatatcct 


240 


gtcaaacact 


gatagtttaa 


actgaaggcg 


ggaaacgaca 


atctgatcat 


gagcggagaa 


300 


ttaagggagt 


cacgttatga 


cccccgccga 


tgacgcggga 


caagccgttt 


tacgtttgga 


360 


actgacagaa 


ccgcaacgat 


tgaaggagcc 


actcagccgc 


gggtttctgg 


agtttaatga 


420 


gctaagcaca 


tacgtcagaa 


accattattg 


cgcgttcaaa 


agtcgcctaa 


ggtcactatc 


480 


agctagcaaa 


tatttcttgt 


caaaaatgct 


ccactgacgt 


tccataaatt 


cccctcggta 


540 


tccaattaga 


gtctcatatt 


cactctcaat 


ccaaataatc 


tgca atg gca att acc 
Met Ala lie Thr 


596 
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245 250 255 260 

gcc gag aag ggc ttt ttc ttt tat ggt gaa gga tec aac act tac gtt 1412 

Ala Glu Lys Gly Phe Phe Phe Tyr Gly Glu Gly Ser Asn Thr Tyr Val 

265 270 275 

tgc aac gtc caa gag caa ata gac cac gaa cgc egg aag gtt gcc gca 1460 

Cys Asn Val Gin Glu Gin lie Asp His Glu Arg Arg Lys Val Ala Ala 
280 285 290 

gcg tgt gga ttg cgt etc aat tct etc ttg cag gaa tgc aat gat gaa 1508 

Ala Cys Gly Leu Arg Leu Asn Ser Leu Leu Gin Glu Cys Asn Asp Glu 
295 300 305 

tat gat act gac tat gaa act ttg agg gaa tac tgc eta gca ccg tea 1556 

Tyr Asp Thr Asp Tyr Glu Thr Leu Arg Glu Tyr Cys Leu Ala Pro Ser 
310 315 320 

cct cat aac gtg cat cat gca tgc cct gac aac atg gaa cat cgc tat 1604 

Pro His Asn Val His His Ala Cys Pro Asp Asn Met Glu His Arg Tyr 
325 330 335 340 

ttt tct gaa gaa tta tgc teg ttg gag gat gtc gcg gca att gca get 1652 

Phe Ser Glu Glu Leu Cys Ser Leu Glu Asp Val Ala Ala lie Ala Ala 

^ 345 350 355 

•33 if 

att gcc aac ate gaa eta ccc etc acg cat gca ttc ate aat att att 1700 

y»j He Ala Asn He Glu Leu Pro Leu Thr His Ala Phe lie Asn He He 

ffl ' 360 365 370 

a : 

[.^ cat gcg ggg aaa ggc aag att aat cca act ggc aaa tea tec age gtg 1748 

Jlf His Ala Gly Lys Gly Lys He Asn Pro Thr Gly Lys Ser Ser Ser Val 

375 380 385 



att ggt aac ttc agt tec age gac ttg att cgt ttt ggt get ace cac 1796 
He Gly Asn Phe Ser Ser Ser Asp Leu He Arg Phe Gly Ala Thr His 
390 395 400 

gtt ttc aat aag gac gag atg gtg gag taa agaaggagtg egtcgaagea 1846 
Val Phe Asn Lys Asp Glu Met Val Glu 
405 410 



gatcgttcaa 


acatttggca 


ataaagtttc 


ttaagattga 


atcctgttgc 


eggtcttgeg 


1906 


atgattatca 


tataatttct 


gttgaattac 


gttaagcatg 


taataattaa 


catgtaatgc 


1966 


atgacgttat 


ttatgagatg 


ggtttttatg 


attagagtcc 


cgcaattata 


catttaatac 


2026 


gcgatagaaa 


acaaaatata 


gcgcgcaaac 


taggataaat 


tatcgcgcgc 


ggtgtcatct 


2086 


atgttactag 


atcgatcaaa 


etteggtact 


gtgtaatgac 


gatgagcaat 


cgagaggctg 


2146 


actaacaaaa 


ggtatgecca 


aaaacaacct 


ctccaaactg 


tttcgaattg 


gaagtttctg 


2206 


ctcatgccga 


caggcataac 


ttagatattc 


gegggctatt 


cccactaatt 


cgtcctgctg 


2266 


gtttgegeca 


agataaatca 


gtgcatctcc 


ttacaagttc 


ctctgtcttg 


tgaaatgaac 


2326 


tgctgactgc 


cccccaagaa 


agcctcctca 


tctcccagtt 


ggeggegget 


gatacaccat 


2386 


cgaaaaccca 


cgtccgaaca 


cttgatacat 


gtgectgaga 


aataggecta 


cgtccaagag 


2446 


caagtccttt 


ctgtgctcgt 


eggaaattec 


tctcctgtca 


gaeggtegtg 


cgcatgtctt 


2506 
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gcgttgatga agctt 



<210> 17 

<211> 413 

<212> PRT 

<213> Agrobacterium tumefaciens 

<400> 17 

Met Ala lie Thr Leu Ser Ala Thr Ser Leu Pro lie Ser Ala Ala Asp 
1 5 10 15 



His His Pro Leu Pro Leu Thr Val Gly Val Leu Gly Ser Gly His Ala 
20 25 30 



Gly Thr Ala Leu Ala Ala Trp Phe Ala Ser Arg His Val Pro Thr Ala 
35 40 45 



Leu Trp Ala Pro Ala Asp His Pro Gly Ser lie Ser Ala lie Lys Ala 
50 55 60 



Asn Glu Gly Val lie Thr Thr Glu Gly Met lie Asn Gly Pro Phe Arg 
65 70 75 80 



Val Ser Ala Cys Asp Asp Leu Ala Ala Val lie Arg Ser Ser Arg Val 
85 * 90 95 



Leu lie lie Val Thr Arg Ala Asp Val His Asp Ser Phe Val Asn Glu 
100 105 110 



Leu Ala Asn Phe Asn Gly Glu Leu Ala Thr Lys Asp lie Val Val Val 
115 120 125 



Cys Gly His Gly Phe Ser He Lys Tyr Glu Arg Gin Leu Arg Phe Lys 
130 135 140 



Arg He Phe Glu Thr Asp Asn Ser Pro lie Thr Ser Lys Leu Ser Asp 
145 150 155 160 



Gin Lys Lys Cys Asn Val Asn lie Lys Glu Met Lys Ala Ser Phe Gly 
165 170 175 



Leu Ser Cys Phe Pro He His Arg Asp Asp Ala Gly Val He Asp Leu 
180 185 190 



Pro Glu Asp Thr Lys Asn He Phe Ala Gin Leu Phe Ser Ala Arg He 
195 200 205 



He Cys He Pro Pro Leu Gin Val Leu Phe Phe Ser Asn Cys He Thr 
210 215 220 
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4 • 



His Ala Val Pro Ala Val Met Asn lie Gly Arg Leu Arg Asp Pro Ala 
225 230 235 240 



Asn Ser Leu Thr Lys Arg Ala Glu Lys Trp Leu Leu Glu Leu Asp Glu 
245 250 255 



Arg Thr Pro Arg Ala Glu Lys Gly Phe Phe Phe Tyr Gly Glu Gly Ser 
260 265 270 

Asn Thr Tyr Val Cys Asn Val Gin Glu Gin lie Asp His Glu Arg Arg 
275 280 285 

Lys Val Ala Ala Ala Cys Gly Leu Arg Leu Asn Ser Leu Leu Gin Glu 
290 295 300 

Cys Asn Asp Glu Tyr Asp Thr Asp Tyr Glu Thr Leu Arg Glu Tyr Cys 
305 310 315 320 

o 

y'3 Leu Ala Pro Ser Pro His Asn Val His His Ala Cys Pro Asp Asn Met 

y'j 325 330 335 

ul 

r ^ Glu His Arg Tyr Phe Ser Glu Glu Leu Cys Ser Leu Glu Asp Val Ala 

^ 340 345 350 

o 
o 

B Ala lie Ala Ala lie Ala Asn lie Glu Leu Pro Leu Thr His Ala Phe 

l.i 355 360 365 



lie Asn lie lie His Ala Gly Lys Gly Lys lie Asn Pro Thr Gly Lys 
370 375 380 

Ser Ser Ser Val lie Gly Asn Phe Ser Ser Ser Asp Leu lie Arg Phe 
385 390 395 400 

Gly Ala Thr His Val Phe Asn Lys Asp Glu Met Val Glu 
405 410 

<210> 18 
<211> 835 
<212> DNA 

<213> Streptomyces hygroscopicus 
<400> 18 

gctcgctgtc attttcgaga cgccatcttt ggaagcggtg gccgaatccg tactgcgcgg 60 
actcgacgac gcgtaaaacg atcgaccacg tacacgagtc cggacacggg gcgaggaggc 120 
ccggttccgg caccgaggaa gaccgaagga agaccacacg tgagcccaga acgacgcccg 180 
gccgacatcc gccgtgccac cgaggcggac atgccggcgg tctgcaccat cgtcaaccac 240 
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♦ # 



ta ca t ccfa era 


caacrca ccrcrt 

V-* d d \-A CI ^— " VA \-4 *— 


caacttccat 


a cccsa^cscccsc 


a cjcr a a c ca c a 

CA U \J CA CA ^— ' \J w CA 


VA CA u l*> U U C& V„x \J 


*3nn 

ouu 


n a rnA pphm 

U CI N-4 CI Vprf* W U 


K \w- ^— ' U ^ C-r U CJ 


U CA U ^ CI w 


ccct~cicici~cct 

^ V»J U \^ ^ \-A 


t ccf ceo a cj crt 


rrrra can enact 

y y ca y ca y 


oou 


y L.L.y UL-Ljy v_~ c* 


■f - /— ■ /-T f +" PI iti*"* 

I— l^r i—- L- d 


prrTrTrT"fP*'t~rTrT 

y y y ^— - V-* V-» V— - <— y V_J 


q ca y y t_>ci y \w- ca 


a rnrrfa r*na 

ca ^--y ^— • V— - CA Ca 


- uy yd i y y i_ i 


4 p n 

*i ^1 L/ 


y ciy Luy duuy 


•f- fT -f- /— • /"T t~ r~* I - c 
L>U l>0 »— » <— 


c*c*CT*c*cit~*c*zi p* 


y ci y ^> y y ct y y 


ya^L,yyyv^uu 




d r n 

" o u 


f*f* f~* T~ /"T 

dL-u^ciuuuyt 


l~y dd y I— 1 ■ I—- 1 L» 


ct ct a ct ct c* a pad 
yydyyi^-di— .dy 


y y i — l, L_ L^ct ct y ct 


Pf P'Pfl - rffT't - fTT f* 

y • y i— y y ucy u 


4- "t - f* a +- r^ft ft ft 

L.y i_ t_d Luy yy 




f 4- n c c c ^ pn 


ci t y y y ^- 


uuy c- y \— d 


y ct y y \_.y v n y 


y ci l_ ct L- y o ^ o 


y L.yy La l. y 




ctgegggegg 


ccggcttcaa 


geaegggaac 


tggcatgacg 


tgggtttctg 


gcagctggac 


660 


ttcagcctgc 


cggtaccgcc 


ccgtccggtc 


ctgcccgtca 


ccgagatctg 


aacggagtgc 


720 


gcgtgggcat 


cgcccgagtt 


ggagctggta 


egggaactea 


tcgaactcaa 


ctggcatacc 


780 


cgcaatggtg 


aggtggaacc 


ft f* ft ft fft ft a r" 

y *— y y (_-y y d lu 


ft f*ffi~ o /~*iTo f* f+ 

y uy u ct (— y cill. 


ft^ ft f* f* f* ^ ft ft a 
y uy t_-i_-(_-ciyyci 


ggect 


a ^ r 


<210> 19 

<211> 623 

<212> DNA 

<213> Oryza sativa 












<400> 19 














tatatacata 


cccccccctc 


t - fT~* "t~ r~« f~*r^ ^ 1~ c 


w v^- Ca Ca y^r 


L- Ca i_,c«. v i — Ca 


^* p*p"a p"p"A p"<™*a 
d * — d * — i — d i — > — d 


60 


cctcctcccc 


cctcgctgcc 


^4 CA C* w U CL U 


ctcctccccc 


ctccccctcc 


y w wy vy v-^- w y 


120 


gtaaccaccc 


cgcgtccctc 


tcctctttct 


ttctccattt 


tttttttcca 

C- C^ C^ C^ 1— 1 — C— C-* 


1~r , 1~ r*rr1 - r , 't - ccr 

* — V^- C- I—- V — \_ V — 


180 


atetttggee 


ttggtagttt 


pfpfpfpfpfp , pf;aj rr ^ 
yyyyyv-^yciyc* 


rrnrnnpht" p*rr 
y y v y y o i_ u-v^-y 


1~ P*fTP*P*P*^ Pf^i 1~ 
i>uy L>ouay a L. 


p« /~t -f- ft e~* ft c* ci ct 
oy y V— y v y o y y 




gaggggeggg 


atctcgcggc 


l. y y y 


n i~ ci c* c\ n c c c\ 
u*y y * — y y ^ — y 


y ci l, <w» \ — * — y ^ — y 


CtCTCI 3 i~ CfCt CfCf 

y y y d d u y y y y 


O \J \J 


etcteggatg 


tagatctgat 


-y t_.y l uy u. 


t - n ft n rx n ^ rt t* 


yaL-yyyyuy l. 


I— laaaaLL. L.O 




gecatgetaa 


acaagatcag 


ydciycAyyyyci 


ca ca c* y y y ^* l- 


a*t~rrrrt"'t _ 1~a1~a 

CA UU U L L La L. CA 


ttttta ta t~a 


420 


tttctgctgc 


tgctcgtcag 


gcttagatgt 


gctagatctt 


tctttcttct 


ttttgtgggt 


480 


agaatttgaa 


tccctcagca 


ttgttcatcg 


gtagtttttc 


ttttcatgat 


ttgtgacaaa 


540 


tgcagcctcg 


tgeggagett 


ttttgtaggt 


agaagatggc 


tgacgecgag 


gatgggggat 


600 


ccccgggtgg 


tcagtccctt 


atg 










<210> 20 
<211> 1506 
<212> DNA 

<213> Trichoderma longibrachiatum 








<400> 20 














accatggctc 


aatctgcttg 


tactcttcaa 


tctgagactc 


atcctccact 


tacttggcag 


60 


aagtgttcat 


ctggtggtac 


ttgtactcaa 


cagactggat 


ctgttgttat 


tgatgetaac 


120 


tggagatgga 


ctcatgctac 


taactcttct 


actaactget 


atgatggtaa 


cacttggtca 


18*0 
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'} 





CI 

63 
O 



t eta ct cttt 


gtcctgataa 


cgagacttgt 


gctaagaact 


gctgtcttga 


tggtgctgct 


24 0 


tacgcttcta 


ettaeggagt 


tactacttct 


ggaaactctc 


tttctattgg 


attegttact 


300 


cagtctgctc 


agaagaacgt 


t qq t qct a qq 

3 3 Z> 3 3 


ttgtacttga 


tggcttctga 


tactacttac 


360 


caagagttca 


ctcttcttgg 


taacgagttc 


tctttcgatg 


ttgatgtttc 


tcaacttcca 


420 


tqtqqtttga 


aeggagcttt 


gtacttcgtt 


tctatggatg 


ctqatqqtqq 

3 3 3 3 3 


agtttctaag 


480 


tatccaacta 


acactgetgg 


agctaagtat 


qqtactqqtt 

3 3 ^~ ^33 


actgtgattc 


tcagtgtcca 


540 


agagatctta 


agttcattaa 


eggacaaget 


aatqttqaqq 


qatqqqaqcc 

3 333 3 w 


atcttctaac 


600 


aatgctaaca 


ctqqtattqq 

3 3 3 3 


aggtcatgga 


tcttgttgct 


ctgagatgga 


tatttqqqaq 


660 


gctaactcta 


tctctgaggc 


tcttactcca 


catccatgca 


ctactgttgg 


acaagagatt 


720 


tqcqaqqqaq 


atqqttqtqq 

" ,- 33 > - fc "3 33 


tggaacttac 


tctgataaca 


gataeggagg 


tacttgegat 


780 


ccagatggat 


gtgattggaa 


tccatacaga 


cttggtaaca 


ctt cttt eta 


eggt ccagga 


840 


tct teat tea 


ctcttgatac 


tactaagaag 


ttgactgttg 


ttactcagtt 


cgagacttct 


900 


ggtgctatca 


acagatacta 


cgttcagaat 


ggagttactt 


tccaacaacc 


taacget gag 


960 


cttggttctt 


actctggtaa 


cgagttgaac 


gat gat tact 


gtactgctga 


ggaagctgag 


1020 


tttggtggat 


cat cttt ct c 


tga taagggt 


ggacttactc 


agtt caagaa 


ggctactt ct 


1080 


yy u yy u yy 


ttcttgttat 


gtct ct ttgg 


gatgattact 


aegctaacat 


gt tgtggctt 


1140 


ga t t ctactt 


acccaactaa 


cgagactt ct 


tctactccag 


gtgctgttag 


aggat ettge 


1200 


tctacttctt 


ctggtgttcc 


tget caagtt 


gaatctcaat 


ctcctaatgc 


taaggttact 


1260 


ttctctaaca 


teaagttegg 


accaattgga 


tctactggta 


acccttctgg 


aggtaatcca 


1320 


cctggaggta 


atccacctgg 


aactactaca 


actaggagac 


cagctactac 


aactggatca 


1380 


tctccaggac 


ctactcaatc 


teattaeggt 


caatgtggag 


gtattggtta 


ctctggtcca 


1440 


actgtttgtg 


cttctggaac 


tacttgtcaa 


gttcttaacc 


cttactattc 


teaatgeett 


1500 


taatga 












1506 



<210> 21 

<211> 26 

<212> DNA 

<213> synthetic DNA 

<220> 
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